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DIFFERENCE SETS AND POLYNOMIALS 


NEIL LYALL ALEX RICE 


Abstract. We provide upper bounds on the largest subsets of {1, 2,..., N} with no differences of the form 
hi{ni) + • • • + h(_{ni) with G N or h\ (pi) + • • • + with pi prime, where hi G Z[ic] lie in in the classes 

of so-called intersective and "P-intersective polynomials, respectively. For example, we show that a subset of 
{1,2,..., N} free of nonzero differences of the form for fixed € N has density at most 

for some p, = p(j, k) > 0. Our results, obtained by adapting two Fourier analytic, circle method-driven 
strategies, either recover or improve upon all previous results for a single polynomial. 


1. Introduction 


1.1. Background. Lovasz posed the following question: If A C N contains no pair of distinct elements 
which differ by a perfect square, must it be the case that 

N—¥oo N 

Here and throughout we use [1, N] to denote {1,2,..., N}. Erdos posed the analogous question with “perfect 
square” replaced by “one less than a prime number”. Furstenberg [3] answered the former question in 
the affirmative via ergodic theory, specifically his correspondence principle, but obtained no quantitative 
information on the rate at which the density must decay. Independently, Sarkozy [231124) showed via Fourier 
analysis, specifically a density increment argument driven by the Hardy-Littlewood circle method, that if 
^ Q [Ijy contains no nonzero square differences, then 

/(log logA)^\i/3 


( 1 ) 


1^1 
^ < , 
N V 


log A 

while ii a — a' p — 1 (or p + 1) for all a,a' G A and all primes p, then 


( 2 ) 


1^1 (log log log Nf log log log log N 
N (loglogA)2 


We use “<C” to denote “less than a constant times”, with subscripts indicating what parameters, if any, the 
implied constant depends on. 


1.2. Improvements and extensions. Using a more intricate Fourier analytic argument, Pintz, Steiger, 
and Szemeredi improved © to 

(3) M (log7V)-“i°si°si°gi°s^, 

with c = 1/12, and by incorporating more delicate analytic number theory results into Sarkozy’s original 
method, Ruzsa and Sanders [22] dramatically improved © to 


where c > 0 is an absolute constant. 


lA p-UlogAT)!/" 
N ^ 


A natural generalization of Lovasz’s question is to extend from perfect squares to the image of more general 
polynomials. Balog, Pelikan, Pintz, and Szemeredi [1] extended © to sets with no power differences for 
a fixed fc G N, with c = 1/4 and the implied constant depending on k. 
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More generally, to hope for such a result for a given polynomial h G Z[a;], it is clearly necessary that h has 
a root modulo q for every g G N, as otherwise there is a set qN with positive density and no differences 
in the image of h. It follows from a theorem of Kamae and Mendes France [5] that this condition is also 
sufficient, in a qualitative sense, and in this case we say that h is an intersective polynomial. Equivalently, a 
polynomial is intersective if it has a p-adic integer root for every prime p. Intersective polynomials include 
any polynomial with an integer root and any polynomial with two rational roots with coprime denominators, 
but there are also intersective polynomials with no rational roots, such as — 19) (a;^ + a; + 1). 

It is a theorem of Lucier m, with minor improvements exhibited by the first author and Magyar m and 
the second author [18], that if ft, G Z[a:] is an intersective polynomial of degree k > 2 and AC [ 1 , has no 
nonzero differences in the image of ft, then 

|A| ^loglog 

"/V \1ogN~) 

Further, Hamel and the authors [5] extended to all intersective polynomials of degree two, for any 
c < 1/ log(3) and the implied constant depending on c and the polynomial. 

To hybridize the aforementioned results, one could ask for a density bound on a set free of differences of 
the form ft(p), for a fixed ft G Z[a;] and p prime, but this requires further restrictions on the polynomial. 
Specifically, for every 9 G N, there must exist r G Z with (r, g) = 1 and q \ h{r), as otherwise ft(p) is divisible 
by q for only finitely many primes p, and hence mqN has no differences of the form ft(p) for sufficiently large 
m. If this condition is satisfied we say that ft is a V-intersective polynomial. Equivalently, a polynomial is 
P-intersective if for every prime p it has a p-adic integer root that does not reduce to 0 modulo p. Examples 
include any polynomial with a root at 1 or — 1 , and any polynomial with rational roots a/b and c/d satisfying 
{ab, cd) = 1 , while [x^ — 19)(x^ + a: + 1 ) again serves as an example free of rational roots. 

The second author m showed that if ft G Z[a;] is V-intersective of degree k > 2 and e > 0, then a set 
Ac [ 1 , with no nonzero differences of the form ft(p) satisfies 

^ (logAf)-5^^+% 

and further ([3]) holds with c = (21og(3))“^ — e if fc = 2. 

Here we have only alluded to the best-known results in each case, all established through versions of the two 
aforementioned Fourier analytic attacks. For intermediate results and alternative proofs, the reader may 
refer to |S], [57], [15], [T5], [TT], [T3], and [5]. 

1.3. Main results. Here we adapt the known Fourier analytic strategies to handle differences of the form 
fti(ni) -b • • • -f he{ne) for a collection of polynomials fti,..., ft^ G Z[a;], as well as incorporate exponential sum 
estimates of Shparlinski [26] previously unused in this context, to establish the following result. 

Theorem 1.1. Suppose till's > 0 are integers and let ^ = £1 -b ^2 + ^ 3 - 

Suppose fti,...,ft^j G Z[a;] are nonzero intersective polynomials, ft^^+i,..., ft^j+f^ G Z[a;] are nonconstant 
monomials, and ft^j+^^+i,..., ft^ G Z[x] are nonmonomials with fti(O) = 0 for i\-\- \ <i <1. 

Let 

^=(E^r'+V 2 + ^ r-^y\ 

i=l i=ii+l2 + l 

where ki = deg(fti) and r^ is the number of nonzero coefficients of hi. 

Suppose further that AC [1, A^j and 


a - a' ^ y^^h^{ni) 

for all distinct pairs a^a' G A and for all ni,..., € N with /ii(ni),..., ^ 0. 
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Then, 


'(log TV)-2113^+<^ 
V logW j 


M 

^hi,...,he,e * 


„-(logAr)'' 


g-c(logAf)i/'‘ 


g-c^/log Af 

where c = c(hi,..., hi) > 0 and /j, = ... ,ki) > 0 . 


if D > 2 and is > 0, for any e > 0 

if D > 2 and is = 0 

if 1<D <2 

if D = 1 and £2 + -^3 > 0 

if D < 1 and £2 + ^3 > 0 

if D < 1 and ^2 = ^3 = 0 


The statement of Theorem 11.11 is admittedly rather obnoxious, and the reader is encouraged to refer to 
Sections o and [T31 for discussions of several digestible and illustrative special cases of the theorem, the 
reasoning for the specified partitioning of the collection of polynomials and its impact on the resulting 
bounds, and the origin and significance of the quantity D. We also establish an analogous result in the 
prime input setting, where the statement is weaker but more straightforward. 

Theorem 1.2. Suppose hi,... ,hi G 'L[x\ are V-intersective polynomials with deg{hi) = ki > 0, and let 

i=l 

If A C [1, iV] and 

e 

a - a' y^ h^ipi) 

i=l 

for all distinct pairs a, a' € A and for all primes pi,... ,pi with hi{pi), ..., hifpi) ^ 0, then 

(log A^) 2 (-D'-i)+'^ if D' > 2 , for any e > 0 

(log N)-!^ ^ ifl<D'<2 

1 

ifD' = l,i>l 

g-c(iog Ar)i/4 if D' < I or i = ki = l 

where c = cihi, ..., hi) > 0 and p = p(ki,... ,ki) >0. 


N 


,e *' 


Remark on constants in Theorems 11.11 and II. ill By utilizing the precise statements of Lemmas IA.31 IA.51 
and IB.21 one can take the constant c in Theorems o and o to be the reciprocal of the maximum over the 
collection of polynomials of the right hand side of the inequality in the conclusion of Lemma IA. 6 I times a 
constant depending only on ki,ki. Further, in the first appearance of the constant p, in each theorem, one 
can take p = 1/2 log(min{fci}) in Theorem ll.il as explicitly shown in Sectionand p = l/41og(min{fci}) 
in Theorem 1 1.2 1 In the second appearance of p in each theorem, one can apply Chen’s [5] explicit bounds on 
the implied constants in Lemma (A.31 and take p = exp(— ki). We also note that at the expense of 
the implied constants in Theorems II.II and II.21 we are free in all cases to assume that the main parameter N 
is sufficiently large with respect to the fixed polynomials hi,... ,hi, so we take this as a perpetual hypothesis 
and refrain from explicitly including it further. 
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1.4. Some special cases. We first note that in the case oi £ = 1, that is to say the previously treated 
cases of a single polynomial, Theorems 11.11 and 11.21 simply recover the previous best-known results, with the 
notable exception of “sparse” polynomials in Theorem ll.il For example, if h{x) = x + 2x^'^ + x^^, then we 
can take D = 3 with £3 > 0, so a set AC [1, A^] free of nonzero differences of the form h{n) satisfies 

(logiV)-H^ 

for any e > 0, whereas previously 1/4 was replaced with 1/30. Moreover, if r = 2, in other words h{x) = 
ax£ + bx^ for some natural numbers j < k and nonzero a, 6 £ Z, then ([3]) holds with c = 1/2 log(fc), a bound 
previously only known for monomials and quadratics. 


In certain cases with £ = 2, Theorem 11.11 provides density bounds superior to any attained in the single 
polynomial case. For example, if a set A C [1, N] lacks nonzero differences of the form hi{m) + h^in)^ where 
£ 1 [x\ are arbitrary intersective quadratic polynomials, including for example sums of two squares, 
then we can take D = 1 with £2 > 0 and .^3 = 0, and therefore 


( 4 ) 


N 




— c-v/log N 


for some c = c(/ii, / 12 ) > 0. This leap in bound quality is not especially surprising in this particular case, as 
the collection of elements of the form hi(m) + h 2 (n) inside of [1, N] has size (much like the p — 1 

case), and is hence far denser than the image of any single nonlinear polynomial. 


What is more notable, perhaps, is that we exhibit bounds of similar quality in certain cases where the 
collection of avoided differences is quite sparse. For example, if A C [1,7V] lacks differences of the form 
m£ + vA for fixed j, fc £ N, then D = 1, £2 > 0, £3 = 0, and 

(5) ^ 

for some c = c{j,k),p = p{j,k) > 0 , despite the fact that the collection of elements of the form 
in [1,7V] has size at most TVj^'^. In fact, the same estimate holds for a set A C [1,7V] free of differences of 
the form hi{m) + h 2 {n) where hi{x) = aiX^ + bix^^ and h 2 {x) = a 2 x£‘^ + 622 ;^^ for nonzero a^, bi G Z and 
natural numbers ji < ki, and c = c(/ii, 7 i 2 ) > 0 . 

As discussed in Section 11.51 the application of the exponential sum estimates utilized to achieve this gain 
for sparse polynomials and monomials requires sieve estimates that impose a limitation on the density 
increment iteration. This limitation results in the factor of two loss with £2 + ^3 > 0 in the exponents of 
certain bounds in Theorem 11.11 For example, if £ = 1 and h{x) = a; -I- -I- x"^, we are better off treating 

h as an arbitrary intersective polynomial of degree 4, resulting in a density bound of about (log7V)“^/^, 
as opposed to a polynomial with three nonzero coefficients, yielding a bound of about (log7V)“^/^. This 
ambiguity in optimal partition where a larger D can yield a better bound is reasonably rare, and only occurs 
when all possible values of D are greater than 2 or all possible values of D are at most 1. 

For some notable examples with £ = 2 in Theorem 11.21 we see that if A C [1,7V] has no differences of the 
form (p — 1)^ + {q— 1)^ with p, q prime, then D = 1 and (O holds with all constants absolute, whereas if A 
lacks differences of the form (p — 1)^ -|- (g — 1)'* with p, q prime, then D = 2 and ([3]) holds with c = 1/4 log(4), 
and of course these powers can be replaced with any pairs of P-intersective polynomials of degree 2 or 4, 
respectively. 

We also note that the specially earmarked case of £ = fci = 1 in Theorem ll.2l is simply the previously studied 
case of h{p) = a(p± 1 ), and we include the necessary tools to recover this result for the sake of completeness. 


1.5. Motivation for D and D' from Gauss sum estimates. In this context, the guiding principle of 
the Hardy-Littlewood circle method is that if /i £ Z[a;], then the Weyl sum 


( 6 ) 


M 


^ ^ g27ri/i(n)a 
n—1 
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is much smaller than the trivial bound M, unless a is well-approximated by a rational number with small 
denominator. 


On a coarse scale, this principle is captured by combining the pigeonhole principle with Weyl’s Inequality 
(see Lemma Fa. 81) . but for a more refined treatment, we must address the following question: If a IS quite 
close to a rational with a quite small denominator, for example smaller than a tiny power of M, can we beat 
the trivial bound at all? 


This question turns out to be quite straightforward, as under these conditions ([HI) has a convenient asymptotic 
formula, and the gain from the trivial bound resides in a local version of the sum, or Gauss sum. Specifically, 
if a is close to ajq with q small, then, up to a small error, the magnitude of (jS]) is at most M times 


(7) 


9-1 

g-1 ^ g2’rili(s)a/g 

s=0 


Moreover, by sieving our initial set of inputs, letting W equal a product of small primes and considering 
only inputs coprime to IT, we can replace ([51) with 

M 

( 8 ) 

n—1 

(n,VF) = l 


in which case the gain from the trivial bound for a near a/q with small q is given roughly by 


(9) 


9-1 

g-1 ^ g2iri/i(s)a/g 

s=0 
(s,9) = l 


In applying the two previously developed Fourier analytic arguments, the resulting density bounds are 
determined by the power 9 = 9{h) such that the magnitude of the relevant Weyl sum for a near a/q with q 
small beats the trivial bound by a factor of q~^ ■ In particular, to run the more intricate method developed 
first in m.o must be at least 1/2. 


Moreover, when considering sums of polynomials, the relevant sum splits, for example 


Ml M 2 

EE g2-7ri(/ii(ni)-|-/i2(n2))Q 

ni n 2 


^ Ml 

'y ^ g27ri/i] 
n—1 


(n)a'j ^ 


M 2 

y ^ g27ri/i2(n)a 
n—1 




so we can add together the corresponding powers 9(hi) and d(/i 2 ). 


In the context of Theorem o we are free, for a given polynomial h G Z[xJ, to choose the better of d?]) and 
with the caveat that when employing the more straightforward of the two methods, choosing m imposes 
an increased limitation on the density increment iteration due to the need to accurately count integers with 
no small prime factors. 


Traditional estimates (see Lemma lA.Sp say that for any h G Z[a;], one can choose ([7]) and take d{h) = 
l/deg(/i). As observed in [I] (see Lemma FA.4ll . if G Z[a;] is a nonconstant monomial then one can choose 
® and take d{h) = 1/2. Finally, if h G Z[a;] has r > 2 nonzero coefficients, then by estimates of Shparlinski 
(see Lemma FA3]) . one can choose dH]) and take 9{h) = 1/r. 


Given a collection of intersective polynomials hi,... ,hi G Z[a:], we choose 9{hi) for 1 <i < I and for a near 
a/q with q small, we can beat the trivial bound on the chosen £-fold Weyl sum by q~^ where 9 = X]i=i ^(^O- 

As previously mentioned, it is this quantity 9 that is the primary determining factor in the eventual density 
bound, with “breaking points” at 0 = 1/2, where the more intricate argument kicks in, and 9 = 1, where 
the more straightforward argument yields particularly good bounds. The quantity D defined in Theorem 
o is simply 1/9, where the reciprocal is taken for aesthetic purposes, and so that D plays the role formerly 
played by the degree of a single polynomial. 
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In the prime input setting of Theorem ll.2l the aforementioned sieve technique does not yield improved Gauss 
sum estimates, so we must stick to traditional gains and set 9{hi) = l/deg(/ii) for 1 < i < £, which explains 
the more straightforwardly defined quantity D' in that theorem. 

Remark on the Generalized Riemann Hypothesis. As previously remarked, the limitations of known sieve 
estimates, as well as our limited knowledge of the distribution of primes in arithmetic progressions, result in 
potentially avoidable losses in Theorems 11.11 and 11.21 respectively. Specifically, if we assume the Generalized 
Riemann Hypothesis, then the factor of 2 can be dropped from the exponents 1/2(D — 1) and 1/2(D' — 1), 
and both appearances of the exponent 1/4 can be changed to 1/2. 


1.6. Lower Bounds and Conjectures. Armed with a collection of 7 elements of Z/65Z, no distinct pair 
of which differ by a mod 65 square, Ruzsa [50] was able to construct a set AC [1, iV] with no nonzero square 
differences satisfying |A| ^ N‘^, where c = (1 + In 7/ln65)/2 Ri 0.7331. Recently, Lewko [TO] made the slight 
improvement to c = (1 + In 12/ln205)/2 Ri 0.7334. 

The finite field analog of the square difference question suggests that c = 3/4 may be a natural limitation to 
Ruzsa’s construction, which could potentially be viewed as evidence toward as the true threshold for 

this problem, while many believe the threshold actually grows faster than for any e > 0 . 

For the p — 1 case, the gap between known upper and lower bounds is even more cavernous. Ruzsa |21j 
constructed a set A C [1, A^] satisfying |A| ^ ]yc/\og\ogN no p — 1 differences, but nothing better in 
this direction is known. Consequently, the full resolutions of even the two original questions, much less the 
various generalizations, are still massively open. 

Remark on generality of Theorems 17.71 and \1.A We note that the necessary intersective condition makes 
perfect sense in a multivariable setting, and analogous results should hold for every intersective integral 
polynomial in several variables, not just diagonal forms. Further, there do exist intersective diagonal forms 
not covered in these theorems. For example, if p is a prime congruent to 1 modulo 90090 that is not the 
sum of two integer cubes (of which there are plenty), then, since p is a sum of two cubes modulo q for every 
G N, a;^ + — p is an intersective polynomial in two variables that cannot be expressed as the sum of two 

single-variable intersective polynomials. 

Acknowledgements and Funding: The authors would like to thank Paul Pollack and Steve Gonek for 
their helpful comments and references. The first author was partially supported by Simons Foundation 
Collaboration Grant for Mathematicians 245792. 


2. Auxiliary Polynomials and Inheritance Propositions 

At some point in the proofs of all cases of Theorems ll.ll and ll.21 we apply a density increment strategy, and 
we need to keep track of the inherited lack of arithmetic structure at each step of the iteration. Specifically, 
if we start with a set free of differences of the form hi{ni) -I- ■ • • -b hi{ni) for polynomials hi ,..., hi, it 
spawns denser sets free of differences that are the sum of elements in new polynomial images. The following 
definitions describe all of the polynomials that we could potentially encounter. 

Remark on notation. In an effort to maintain a bearable aesthetic, we frequently utilize both subscripts and 
superscripts for indexing purposes. Through context and consistency, we hope to avoid any confusion in 
distinguishing between superscript indices and exponents. 


2.1. Auxiliary Polynomials. Suppose hi,...,hi G Z[x] is a collection of intersective polynomials. For 
each 1 < i < £ and each prime p, we fix p-adic integers zf with hi(zf) = 0 , requiring that zf ^ 0 mod p if hi 
is P-intersective. If considering the unrestricted input case of Theorem 11.11 and hi{0) = 0, we take zf = 0 
for all p, and similarly in the prime input case of Theorem 11.21 we take zf = ±1 for all p in the event that 
/i*(±l) = 0. 
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Remark. The definitions which follow certainly depend on the choice of p-adic integer roots, but any choice 
subject to the aforementioned restrictions works equally well for our purposes, and we suppress the depen¬ 
dence on this choice in the coming notation. 

By reducing modulo prime powers and applying the Chinese Remainder Theorem, the choices of zf determine, 
for each natural number d, a unique integer rf £ (—d, 0 ], which consequently satisfies d | hi{rf), and in the 
case that hi is T’-intersective we have {rf,d) = 1. 

We define the function Ai on N by letting Xi{p) = p™’ for each prime p, where rm is the multiplicity of zf 
as a root of hi, and then extending it to be completely multiplicative. Further, we define 

A = Ai o • • • o and Ai = Ai o • • • o Ai_i o A^+i o • • • o A^. 

For each d £ N and 1 < i < £, we define the auxiliary polynomial hf by 

ht{x) = hfirf + dx)/\^{d). 

If p-1 I d for p prime and j £ N, then since rf = zf mod p-l, we see by factoring hi over Zp that all the 
coefficients of hi{rf + dx) are divisible by p-l^b hence each auxiliary polynomial has integer coefficients. We 
also note that if di( 0 ) = 0 , then the number of nonzero coefficients of hf is the same for all d. 


2.2. Inheritance Propositions. It is important to note that the leading coefficients of the auxiliary polyno¬ 
mials grow at least as quickly, up to a constant depending only on hi, as the other coefficients. In particular, 
if bf is the leading coefficient of hf, then for any a; > 0 we have that if bf > 0 , then 


( 10 ) 


{n £ N : 0 < hf{n) < x} A [ 1 , (x/bf)^/^] 




where A denotes the symmetric difference and the analogous observation holds if bf < 0. 


We define these auxiliary polynomials to keep track of the inherited lack of arithmetic structure at each step 
of a density increment iteration. For the unrestricted input setting in Theorem ll.il we define 

{ {h{n) > 0 : n £ N} if d has positive leading coefficient 

{d(n) < 0 : n £ N} if d has negative leading coefficient 

for a nonzero polynomial d £ Z[x]. 

For the prime input setting in Theorem 11.21 given a collection of intersective polynomials hi,... ,h( £ Z[x], 
we let 

Af = {a; £ N : r); -I- dx is prime} 


for each d £ N, and for a nonzero polynomial d £ Z[x] we define 

{ {d(n) > 0 : n £ A(*} if d has positive leading coefficient 

{d(n) < 0 : n £ A(*} if d has negative leading coefficient 

Note that the polynomials do not need to be P-intersective for these latter definitions to make sense, but if 
they are not then some of the sets Af are nearly if not completely empty. 

For any sets A,BC'Z, we use the standard notation A zL B = {a zt b : a G A, d£ B} for the sum and 
difference sets, respectively. The following two propositions make precise the aforementioned inherited lack 
of structure in each case. 

Proposition 2.1. // di,..., d^ £ Z[x] is a collection of intersective polynomials, A C N, 

(A - A) n (/(d?^) + • • ■ + i{hf )) c {Oj, 
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and A' C £ N : a; + X{q)i £ A}, then 

{A' - A') n + • • • + c {0}. 

Proof. Suppose that A C N, A' C {£ £ N : x + X{q)£ £ A}, and 

i 


0 ^ a- a' = J2 {ui) = ^ 


h.(^^X<})<ii _|_ t + Xi{q)din^ 


X. 


(Xi{q)di'^ 


= E- 

2=1 


X{q)Xi{di) 


for some ni,..., £ N, a, a' £ A', with all polynomial terms having the same sign as the corresponding 

leading coefficient. By construction we know that = rf' mod di, so there exists Si G Z such that 

_ ^di _|_ therefore 

i ^ d- ~ 

n / I w \ \ hi{fi''+ di{si + Xi{q)ni)) , 

2_^h^'{s^ + X^{q)ni) = 2_^ --——-= A(g)(a-a). 

2=1 2=1 

Because A' C {.^ £ N : x + X{q)l £ A}, we know that X{q){a — a') G A — A, hence 

{A-A)r^ (l{h<t) + • • • + I{hf)) % { 0 }, 

and the contrapositive is established. □ 


We utilize the following analog of Proposition 12.11 in the proof of Theorem 11.21 
Proposition 2.2. If hi,..., hi G Z[x] is a collection of intersective polynomials, A C N, 

iA-A)n + ■ ■ ■ + vf (hf)) c {0}, 

and Al' C {£ £ N : X + X{q)£ G A}, then 

(yl'-A')n + + C {0 }. 

Proof. The proof is identical to that of Proposition 12.11 with the added observation that if n £ and 

^\,{q)d _ _|_ |.]^gg g _(_ £ Af. □ 


3. Preliminaries for Sarkozy’s Method 

In sections S] and [SJ we apply adapted, streamlined versions of Sarkozy’s [531 HI] original density 
increment method, more closely modeled after m, [S], and m- Here we set the stage with some requisite 
tools and notation. 


3.1. Fourier analysis on Z. We embed our finite sets in Z, on which we utilize the discrete Fourier 
transform. Specifically, for a function F : Z —>■ C with finite support, we define F : T —?> C, where T denotes 
the circle parameterized by the interval [0,1] with 0 and 1 identified, by 

F{a) = ^F(x)e-2’^““. 


Given A^ £ N and a set H C [1,A^] with |A| = 6N, we examine the Fourier analytic behavior of A by 
considering the balanced function, fA, defined by 

/a = 1a - i51[i,a]- 
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3.2. The circle method. We analyze the behavior of Ja using the Hardy-Littlewood circle method, de¬ 
composing the frequency space into two pieces: the points on the circle that are close to rationals with small 
denominator, and those that are not. 


Definition 3.1. Given 7 > 0 and Q > 1, we define, for each g G N and a G [I,?], 


Ma/g(7) 


a G T : 


a 

a - 

q 


< 7 


M,(7)= IJ M,/,(7), 

{a,q) = l 

and 

m;(7) = |Jm,(7) = |Jm„/,(7). 

r\q a=l 

We then define 911 ( 7 , Q)j the major arcs, by 


Q 

an(7,Q) = Um,(7), 

q=l 

and tn( 7 , Q), the minor arcs, by 

m( 7 ,Q)=T\ 911 ( 7 ,Q). 

We note that if 27 ( 5 ^ < 1, then 

(11) M„/,(7)nMb/,(7) = 0 

whenever ajq^hjr and q,r <Q. 


3.3. Preliminary notation. Before delving into the details of the arguments for Theorems II.II and II.21 we 
take the opportunity to define some relevant sets and quantities, depending on polynomials hi,... ,hi G 1i[x\, 
a partition ^ = ^l + £2 + £ 3 , scaling parameters di,... ,di, a parameter ij > 0, and the size of the ambient 
interval N, that will be used in both cases. In all the notation defined below, we suppress all of the 
aforementioned dependence, as the relevant objects will be fixed in context. 


To this end, given intersective polynomials hi,... ,hi G Z[a;], we let ki — deg(/ii), k = (ki,..., ki), k = 
ntifc,, iG = 2l0^ and 

i=l 

For ease of notation when specifying the dependence of constants on all of these polynomials, we use h to 
denote {hi ,..., h(). Further, when working in [1, N] with scaling parameters di,... ,di G N, we define the 
following for 1 < f < £: 

M, = [{NlU\h\f/^'\, 

where hi is the leading coefficient of , and 


H, = 


{n^n-.Q <hf{n) < iV/9£} 
{n G N : -N/9£ < hf{n) < 0} 


if > 0 
if < 0 ’ 


noting that by m we have 


(12) A [1,M,]| 1 . 

For n G N^, we let h{n) = hi^{ni) H --|- hf‘{ne), and we let Z = {n G G Hi, h{n) = 0}. 


In the context of Theorem o in Section m given nonnegative integers £i,£ 2,£3 with -|- £2 + ^3 = £, we 
fix a real number e by setting e = 0 if £2 = ^3 = 0 and letting e > 0 be an arbitrary positive number if 
^2 + ^3 > 0 . If navigating the argument with a particular collection of polynomials in mind and £2 = £3 = 
one can replace e with 0 throughout and any dependence of constants on this parameter should be ignored. 
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Also in Sectional we employ a trick of initially sieving our input values in order to improve exponential sum 
estimates. To this end, given 77 > 0, we let 

w= n 

p<y-(-D' + e) 


where the product is taken over primes. For 1 < i < £ we let Ili = {n G Hi : (n, W) = 1} and 

M. = M. n (>-;)■ 

p<7^-C^^+e) 

Further, we define H = (^Hi x • • • x He-^ x • • • x \ Z and 

ti e 

M = Y[m, []; Mj. 
i=l j=il + l 

By (fT^ we see that \Z\ <Ck Yli= Mi, and further noting the standard estimate 

n (i-j)»ttoEV)-‘, 


(13) 


p<X 


we see that in fact 

(14) \H\ > M/2, 

provided, for example, that t]~^^ 


In the context of Theorem ll.2l in Section [5l the aforementioned sieving does not yield the desired gains, so 
we make the more straightforward definitions H — (^Hi x • • • x Hi^ \ Z and M = riti -^*1 analogously 
noting that 

(15) \H\>M/2. 


4. Sarkozy’s Method: Theorem o FOR D > 2 AND < 1 


For the remainder of this section, we fix intersective polynomials hi,... ,he G 'Z[x\, partitioned into groups 
of size ii,£ 2,£3 > 0 as indicated in Theorem 11.11 Namely, hi,... ,hi^ are arbitrary nonzero intersective 
polynomials, ..., s-''® nonconstant monomials, and h^j+^ 2 + 1 , ... ,hi are nonmonomials with no 

constant term. Throughout the argument, when working in [l,iV], we let Ci = {\0kK)~^ and define 


Q = Q{N) 


if ^2 = 4 = 0 

gCiVTH?TV if£2 + £3>0' 


We deduce Theorem 1 1.1 1 (outside of 1 < Z? < 2) from the following iteration lemma, which states that a set 
deficient in the desired arithmetic structure spawns a new, significantly denser subset of a slightly smaller 
interval with an inherited deficiency in the structure associated to appropriate auxiliary polynomials. 

Lemma 4.1. Suppose A C [l,iV] with |A| = 6N. If {A — A) r\ (^I{hi^) + ••• + I{h'^‘)^ C {0} and 
di,... ,de, 5“^ < Q, then there exist q <Ch,£ and A’ C [1, TV'] with N' ;^h,e +e)(fe+i)TV, 


and 

for some c 


(6 + cS^+^^ tfD>l 

(l + clog-^((5-i))(5 tfD = l,£2+£3>0 
[(l + c)5 i/L> = 1,^2+4 = 0 or Z)< 1 

(A' - A') n + • • • + c {0}, 

c(h, e) > 0 and C = C(k). 
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Proof that Lemma 14.11 implies Theorem 11.11 for D > 2 and D < 1. Throughout this proof, we let C 
and c denote sufficiently large or small positive constants, respectively, which we allow to change from line 
to line, but can depend only on h and e. We use C and c' similarly, but these constants can depend only 
on k. Suppose A C [1,-/V] with \A\ = 5N and 

(A-A)n (/(hi) + ... + /(h,)) c{ 0 }. 


Setting Aq = Nq = N^ = 1, and Sq = 5, Lemmayields, for each m, a set Am ^ [1, with 

|-^m| — S-nd 

(A - A) n (i{hf) + • • • + i{hf)) c {0} 

satisfying 


(16) 




(17) 


and 

(18) 

as long as 
(19) 


r Sm-l + if D > 1 

Sm> hl + cl0g-^'(C-l))'^m-l ifZl = l,4+4>0 

[(l + c)<5™_i ifZl = l ,£2 + 4 = 0orD< 1 

dT < < (c(5)-'=(^+‘=)'", 

if £2 =4=0 

* ’ - I gciviogiv,„ if £2 + 4 > 0 ■ 


If £> > 1 and £ 2=4 = 0 (and hence e = 0), then by (ITT)) we see that the density Sm will surpass 1, and 
hence (IT^ must fail, for m = In particular, by (IT6)) and m we must have (c(5) > N, 

which implies 

/loglog A^n 

* 

as required. 


If Z? > 1 and £ 2+^3 > 0, then we make the same observation for m = CS i+2e)^ hence by (fTOl) and 
(fT^ we must have {cS)~'^^ ' ^ .^^hich implies 


/ log log N 
V log 


l/2(D-l+2e) 


<h,. (logAr)-i/2(^-i+3^). 


Further, if ZI = 1 and £2 + £3 > 0 , then (IT^ must fail for m = Clog*^ {6 ^), which by (IT6l) and (ITSl) yields 
^ hence 

5<h e-0°8^)°'. 

If H = 1, £2 + £3 = 0, then we see that (fTOl) must fail for m = Clog(i5 ^), and by (flBl) and (fTBl) we must 
have ( 0 ( 5 )“*"which implies 

(5<h 


Finally, if Z 1 < 1 and £2 = £3 = 0 , then again (fT^ must fail for m = Clog(5 ^), so by (fT6l) and (fTHl) we must 
have (€(5)“*^*°®^^ ^ > N, and therefore 

(5 <h 

We have now established nontrivial bounds in all cases, and these bounds match the claims in Theorem ll.il 
outside of the range 1 < Z1 < 2. □ 
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The philosophy behind the proof of Lemma 14.11 is that a deficiency in the desired arithmetic structure from 
a set A represents nonrandom behavior, which should be detected in the Fourier analytic behavior of A. 
Specifically, we locate one small denominator q such that /a has concentration around rationals with 
denominator q, then use that information to find a long arithmetic progression on which A has increased 
density. 

Lemma 4.2. Suppose A C [1, N] with |yl| = SN, let iq = cqS for a sufficiently small constant cq = co(h, e) > 
0, and let 7 = /N. If {A - A) D + ••• +/(hf)) C {0}, di,... < Q, and |y4 n 

{N/9,8N/9)\ > 3(5iV/4, then there exists q < such that 

f \fAia)\'^da^hA \s^log-^{6-^)N z/= 1 ,£2 + 4 > 0 

1^2^ )i/Li = 1,4+4 = 0 or £>< 1 


for some C = (^(k). 


Lemma |4 .1 1 follows from Lemma l4.2l and the following standard density increment lemma. 

Lemma 4.3 (Lemma 2.3 in [18], see also [13], [22]). Suppose A C [l,iV] with \A\ = SN. If0<9<l and 

[ \]A{a)\^da>06^N, 

then there exists an arithmetic progression 

P = {x + £q:l<£<L} 

with qL ^ min{0iV, 7 “^} and |4 fl P| > (5(1 + 9I32)L. 


Proof of Lemma 14.11 Suppose A C [1, N], |4| = SN, [A — A) C\ + • • • + ^ {0}, and 

di,...,de,S-^ < Q. If l^n (lV/9,8iV/9)| < SSN/4, then max{|A n [1,1V/9]|, |A n [87V/9,A^]|} > SN/8. In 
other words, A has density at least 9(5/8 on one of these intervals. Otherwise, Lemmas 14.21 and 14.31 apply, so 
in either case, letting ry = coi5, there exists q < and an arithmetic progression 


with qL ^h,£ S^ and 


P = {x + £q:l<i<L} 


(s + cS^+^^ a D>i 

|AnP|/L> i (l + clog-^((5-i))<5 if P = 1,4+4 >0 

[(l + c)5 if £1 = 1,4+4 = 0 or P < 1 

Partitioning P into subprogressions of step size X{q), the pigeonhole principle yields a progression 

P' = {y + eX{q) : 1 < e < N'} C P 

with N' > qL/2X{q) and \A fl P'\/N' > |A fl P\/L. This allows us to define a set A' C [1, by 

A' = {(. & [1, N'] : y + iX{q) G A}, 

which satisfies |A'| = |AnP'| and N' »k,e S^'^'^N/X{q) »h.e S^^'N. Moreover, by Proposition 12.11 
(A - A) n (/(<4 + • • • + J(hf)) C {0} implies (A' - A') n + • • • + C {0}. □ 


Our task for this section is now completely reduced to a proof of Lemma 14.21 
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Proof of Lemma 14.21 Suppose A C [l,iV] with |^| = SN and (A—A)nl(h‘l^) + - ■ • + C {0}. Further, 

let ?] = cqS for an appropriately small cq = co(h, e) > 0 and let Q = 77 “^^+'^). Since h^'{Hi) C [—N/9i, N/9i], 

X] fA{x)fA{x + h{n)) = ^ Ia{x)Ia{x + h{n)) - (5 ^ lA(a;)l[i,Ar](x + h{n)) 

x^Ia x^Ij xGZ 

nG-ff nGH 


l[i.w](a: + ^(n))lA(x)^ l[i_jv](a;)l[i,Ar](a: +/i(n)) 


x^Ta 

nGH 


x^Ta 

nGH 


< (^S^N -2S\An{N/9,8N/9)\yH\. 

Therefore, if fl (iV/9, 8A^/9)| > 3SN/4, then by (IT4t we have 

(20) ^ fA{x)fA{x + h{n)) < -6^NM/4. 

x^Ta 

nGH 

One can easily check using (HID and orthogonality of characters that 

( 21 ) ^ fA{x)fA{x-\-h{n)) = [ \fA{a)\‘^S{a)da-\-OY,{NM/mmMi), 

Jo 

where 


nGH 


Mi Mi 

t d-i 


5,(a) = ^62""“^ S,{a)= Ya and 5(a)=n^*(«) H 

i=i j=ei+i 


n —1 


n—1 

(n,Vl^) = l 


Combining (EUl) and (1^ . we have that if 5 ^ < Q then 


( 22 ) 


|/A(a)nS'(a)|da > S‘^NM/8. 


Letting 7 = 77 + /N, it follows from various exponential sum estimates and observations of Lucier on 

auxiliary polynomials that if di,..., d^, < Q, then for a G Mq( 7 ), q < Q, we have 


(23) 

where 


b{q) = 


|^(a)| <h biq)M, 

if € 2 +4 = 0 


^CA<i)q-yD if£2+4>0’ 
uj{q) is the number of distinct prime factors of q, and C = (^(k). Further, for a G m( 7 , Q) we have 
(24) \S{a)\ < 6M/16, 

provided cq was chosen sufficiently small. Details of these estimates are provided in Appendix El 
From (l24l) and Plancherel’s Identity, we have 

[ \U{a)\^\Sia)\da<S^NM/W, 


/OT(7.Q) 


|/A(a)nS'(a)|da > S^NM/16. 
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which together with yields 
(25) 


r ^ 

|/A(a)pda »h 


From (1^ and (1^ , we have 

(26) _ 

q=l •^M,(7) 

If I? > 1, then the function b{q) satisfies b{qr) > b{r)/q, and we make use of the following proposition. 

Proposition 4.1. For any 7, Q > 0 satisfying 2'yQ'^ < 1 and any function 6 : N —>■ [0, c») satisfying 
b{qr) > b{r)/q, we have 

Q 


max/ \fA{a)\^da>Q(2'^qb{q)\ ^b{r) ( \fA{a)\^da. 

9<Q7m;(7) ^ ^ ^7^1 J^Fi) 


Proof. By (fTTI) we have 
Q 


^ p ^ Q P _ 

\fA{a)\‘^da>J2<lK<l) \fAia)\‘^da 

\=i ^'?<QJm'( 7) dM'(7) 


Q _ p 

\fA{a)\^da 

q=l r\q ''Mrt't) 


Q p _ Q/’’ 

= X! / \fA{o:)\'^da^qrb{qr) 


Q/r 

E- 

9=1 




/M,(7) 


\fAia)fda, 


where the last inequality comes from replacing b{qr) with b{r)/q, and the proposition follows. 
Using the known estimate 


□ 


Q 

^ <c Q log*^ Q (see [IS] ) 

q=l 


for any C > 0, we see that 
(27) 


Q 1 ifi^ = 1,^2 +4 = 0 

Q~^ ^ <?&(?) <k,e < log‘^(5“^) if H = 1, 4 + 4 > 0 

9=1 j-(r>-i+2e) 


which combined with ((26ll and Proposition 14.11 establishes the lemma for Z? > 1. If Z) < 1, then the lemma 

00 

follows from (l26ll and the fact that ^ b{q) converges. □ 

9=1 

5. Sarkozy’s Method with Prime Inputs: Theorem 11.21 for. D' > 2 and H' < 1 


Structurally speaking, the arguments for Theorem 1 1.2 1 are essentially the same as those for Theorem ll.il 
with careful adaptations required to account for our somewhat limited understanding of the distribution of 
primes in arithmetic progressions. 
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5.1. Counting primes in arithmetic progressions. For x,a,q € N, we define 

ilj(x,a,q)= ^ logp, 

p<x 

p=a mod q 

where the sum is taken over primes. The classical estimates on q) come from the famous Siegel-Walfisz 

Theorem, which can be found for example in Corollary 11.19 of m- 

Lemma 5.1 (Siegel-Walfisz Theorem). If q < (logx)^, and {a,q) = 1, then 

fj{x, a, q) = x/<f{q) + 

for some constant c = c{B) > 0. 

Ruzsa and Sanders [22] established asymptotics for Tfi{x,a,q) for certain moduli q beyond the limitations of 
Lemma l5.II by exploiting a dichotomy based on exceptional zeros, or lack thereof, of Dirichlet L-functions. 
In particular, the following result follows from their work. 

Lemma 5.2. For any Q,B > 0, there exist qo < and p G [1/2,1) with (1 — p)~^ <C qo such that 


(28) 


'ip{x,a,q) = 


X x(a)a;^ 


O 


(^x exp ^ 


clogx 


\/logx -I- B'^ log Q 


B- 


log q) , 


where x is o Dirichlet character modulo qo, provided qo \ q, {a,q) = 1, and q < (qoQ)^ 


Lemma [5.21 is a purpose-built special case of Proposition 4.7 of [52], which in the language of that paper 
can be deduced by considering the pair {Q^ Q^), where go is the modulus of the exceptional Dirichlet 

character if the pair is exceptional and go = 1 if the pair is unexceptional. 

It is a calculus exercise to verify that if /3 G [0,1/2] and x > 16, then 1 — x~^/{I — ft) > ft, which implies 
that the main term in Lemma 15.21 satisfies 

(29) 5i(^(a:: - x{a)xP/p)/(fiq)^ > (1 - p)x/(l){q) » x/qo(j){q). 

5.2. Main iteration lemma. For the remainder of this section, we fix nonzero P-intersective polynomials 
hi,..., hi G Z[x]. Unlike in Sectional we also must fix at the outset a natural number N, in order to 
carefully apply estimates on 'tp{x,a,q). 

Specifically, we let Q = e'^Wiog w £qj. sufficiently small constant ci = ci(k) > 0, and we apply Lemma [521 
with B = lOK, letting go < 2^°^, p G [1/2,1), and the Dirichlet character x be as in the conclusion. 

We see that if ci is sufficiently small and X > then 

(30) + 
for all X < X, provided go | g, (a, g) = 1, and g < (goQ)^°^. 

We deduce Theorem 11.21 foutside of 1 < D' < 2) from the following analog of Lemma Td. II 
Lemma 5.3. Suppose A C [1,L] with |A| = 5L and L > \fN. If 

(A - A) n (vtHhf^) + ■■■ + vf (/if)) C {0}, 

go I di and di/qo,6~^ < Q, for 1 < i < £, then there exists q <Ch,£ and A' C [1,L'] with V ^h.e 

(S + cS^'+^^ ifD'>l 

y > Ul + clog-^(5-i))d tfD' = l,l>l 

[ (1 + c)6 if D' < 1 or £ = ki = 1 
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and 


{A' - A') n + • • • + ^ |q| 

for some c = c(h, e) > 0 and C = C'(k). 


Proof that Lemma 15.31 implies Theorem 11.21 for D' > 2 and D' < 1. Suppose A C [1,7V] with 
|A| = 5N and 


a-a' hi{pi) 

i=l 

for all distinct pairs a, a' G d. and for all primes pi, ... ,pi with hi{pi ),..., hi{pi) 0. In particular, this 
implies that 

{A — A)C\ (/i]) H + Vl{h\'^ C {0}. 

Partitioning [1,-/V], the pigeonhole principle guarantees the existence of an arithmetic progression 

P = {x + £X{qo) : 1 < £ < No} C [1, N] 
with No > N/2X{qo) and | A fl P| > SNq. Defining Aq C [1, iVo] by 

^0 = {£ & [1, No] ■ X + £X{qo) G A}, 

we see that |^o| > ^TVq and 

(Ao - ^o) n + • • • + C {0}. 

After this initial passage to a subprogression, Theorem 1 1.2 1 (for D' > 2 and D' < 1) follows from Lemma 15751 
in a manner completely analogous to the deduction of Theorem 11.11 from Lemma 14.11 □ 


We establish Lemma 15.31 from the following analog of Lemma 14.21 


Lemma 5.4. Suppose A C [1,L] with |A| = SL and L > \fN, let rj = cqS for a sufficiently small constant 
Co = co(h,e) > 0, and let 7 = /L. If qo \ di, di/qo,S~^ < Q, 

(A-A)n + • • • + vf (/if)) c {0}, 

and |An (L/9,8L/9)| > SSLjA, then there exists q < such that 

^ r S^' + l+2e]^ if D' > I 

f \fA{a)\^da:>h,e \s^log-^{S-^)L ifD' = l,£>l 

[s^L if D'<l or £ = ki = l 


for some C = C{k). 


The deduction of Lemma 15.31 from Lemma 15.41 is effectively identical to the deduction of Lemma 14.11 from 
Lemma 14.21 and our task for this section is now reduced to a proof of Lemma 15.41 


Proof of Lemma 15.41 The proof of Lemma (5.41 is analogous to that of Lemma [4.21 Suppose AC [1,L] 
with |A| = (5L, L > '/N, 

(A-A)n (vf (/if) + ---+Vf (/if)) C {0}, 

qo I di, and di/qo,6~^ < Q. We make liberal use here of notation defined in Section|3l defining the relevant 
objects in terms of L, the size of the current ambient interval, as opposed to the previously fixed N. 

For 1 < * < .^, we define functions Vi on Z 

jy.(„) = log{d^n + rf )l^di (ii), 

CLi z 

and for n € we let z/(n) = 
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Just as in the derivation of we have by m and orthogonality of characters that 


/ \fA{oi)\^\S{a)\da > I ^ fA{x)fA{x + h{n))v{n) + 

Jo I ^ V minM, J 

nGH 


and hence 

( 31 ) j\u{a)f\S{a)\da>d^Ld>/2 + Ou{^^^^), 


where 


Mi 

n—1 


and 


e 

S{a) = Y[S,{a), 


^ - 


Yl^iidAl. 


' di^' d, 


jdi). 


From o and (ESI), we know that 


(32) 


i 


^' > J|c(l 


p)M, > {Cqo)-^M > 


which combined with (1311) implies 

(33) [ \U{a)\^\S{a)\da>S^L'df/d. 

Jo 

We let r] = cqS for a sufficiently small constant Cq = co(h, e) > 0,Q = T] and 7 = Q/L. It then follows 

from various exponential sum estimates, observations of Lucier on auxiliary polynomials, and Theorem 4.1 
of m that if a G Mq( 7 ), q < Q, then 


(34) 


|5(a)| <h 


4'/</)(g) 

C'‘^('?)g-i/-°'(g/</)(q))^4' 


if £ = fci = 1 
else 


where uj{q) is the number of distinct prime factors of q and C = C(k), and 


(35) |5(a)| < J'I'/S for all a G 1 x 1 ( 7 , Q), 

provided we choose cq sufhciently small. We discuss these estimates in more detail in Appendix [B1 and the 
remainder of the proof is completely analogous to that of Lemma 14.21 □ 


6. Double Iteration Method: Theorem o for 1 < L> < 2 

In this section, we apply an adapted version of a double iteration argument, developed by Pintz, Steiger, 
and Szemeredi m and previously modified and streamlined in [1] and [ 8 ]. A technical difference with the 
previous method is the necessity that both the space and frequency domains be discrete. 
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6.1. Fourier analysis and the circle method on Z/A^Z. We identify subsets of the interval with 

subsets of the hnite group Zjv = Z/iVZ, on which we utilize the normalized discrete Fourier transform. 
Specihcally, for a function F : Z^r —>■ C, we dehne F : Z^r —>■ C by 


F{t) = 


1 ^ 


We make the analogous definitions for the major and minor arcs on Z^v, singling out the zero frequency 
rather than introducing the balanced function. 


Definition 6.1. Given TV G N and K,Q > 0, we define, for each g G N and a G [1, g], 


Ma,,{K) = \t€ZN: 4 -- 
I N q 

We then define 911(iV, Q), the major arcs, by 


< 


K 

N 


I and Mg{K)= (J \ {0}. 


{a,q) = l 


miK,Q)=\jM,{K), 

9=1 


and m{K, Q), the minor arcs, by m{K, Q) = Z^r \ [iXSl{K, Q) U {0}). ft is important to note that as long as 
2KQ‘^ < N, we have that n = 0 whenever alq^hjr, q,r < Q. 


We note that the sets defined above certainly depend on N, despite its absence from the notation. In 
practice, N should always be replaced with the size of the appropriate ambient group, often denoted in the 
intermediate stages of the iterations by L. 


6.2. Overview of the argument. In a manner essentially identical to our attainment of (1261) in Section 
m we begin by observing that if {A — A)r\ + • • • + C {0} for a set ^ C [1, N] and intersective 

polynomials hi,... ,hi G Z[a;], one can apply the circle method and Weyl sum estimates to show that this 
unexpected behavior implies substantial mass of A over nonzero frequencies near rationals with small 
denominator. 

At this point, the traditional method, which we employed in Section |4l is to use the pigeonhole principle 
to conclude that there is one single denominator g such that A has concentration around rationals with 
denominator g. From this information, one can conclude that A has increased density on a long arithmetic 
progression with step size an appropriate multiple of g, leading to a new denser set with an inherited lack of 
structure and continued iteration. 

Pintz, Steiger, and Szemeredi m observed that pigeonholing to obtain a single denominator g is a potentially 
wasteful step. We follow their approach, observing the following dichotomy: 

Case 1. There is a single denominator g such that A has extremely high concentration, 
greater than yielded by the pigeonhole principle, around rationals with denominator g. This 
leads to a very large density increment on a long arithmetic progression. 

Case 2. The mass of A on the major arcs is spread over many denominators. In this 
case, an iteration procedure using the “combinatorics of rational numbers” can be employed 
to build a large collection of frequencies at which A is large, then Plancherel’s identity is 
applied to bound the density of A. 

Philosophically, Case 1 provides more structural information about the original set A than Case 2 does. 
The downside is that the density increment procedure yields a new set and potentially a new polynomial, 
while the iteration in Case 2 leaves these objects fixed. With these cases in mind, we can now outline the 
argument, separated into two distinct phases. 


18 




Phase 1 (The Outer Iteration): Given a set A and intersective polynomials hi,...,hg G 
Z[a;], ordered and partitioned as in SectionlH with (A — A) 0 + • • • + I{hi^ C {0}, 

we ask if the set falls into Case 1 or Case 2 described above. If it falls into Case 2, then we 
proceed to Phase 2. 

If it falls into Case 1, then the density increment procedure yields a new subset Ai of a 
slightly smaller interval with significantly greater density, and 

(Ai - Ai) n + • • ■ + c {0}. 

We can then iterate this process as long as the resulting interval is not too small, and the 
dichotomy holds as long as the coefficients of the corresponding polynomials are not too 
large. We show that if the resulting sets remain in Case I, and the process iterates until the 
interval shrinks down or the coefficients grow to the limit, then the density of the original 
set A must have satisfied a bound stronger than the one purported in Theorem II.II for 
1<D<2. 

Contrapositively, we assume that the original density does not satisfy this stricter bound, 
and we conclude that one of the sets yielded by the density increment procedure must lie 
in a large interval, have no nonzero differences in + • • • + for reasonably small 

di,..., df, and fall into Case 2. We call that set B C [I, L]. 

We now have a set B C [1, L] with {B — B) D + • • • + C {0} which falls into Case 2, so we 

can adapt the strategy of and [8]. It is in this phase that we use that D < 2. 

Phase 2 (The Inner Iteration): We prove that given a frequency s G’Ll with s/L close to 
a rational a/q such that B{s) is large, there are lots of nonzero frequencies t gLl with tjL 
close to rationals b/r such that B{s + t) is almost as large. This intuitively indicates that 
a set P of frequencies associated with large Fourier coefficients can be blown up to a much 
larger set P' of frequencies associated with nearly as large Fourier coefficients. 

The only obstruction to this intuition is the possibility that there are many pairs {a/q, b/r) 
and {a'/q',b'/r') with a/q + h/r = a'/q' + b'/r'. Observations made in [T^ and [1] on the 
combinatorics of rational numbers demonstrate that this potentially harmful phenomenon 
can not occur terribly often. 

Starting with the trivially large Fourier coefficient at 0, this process is applied as long as 
certain parameters are not too large, and the number of iterations is ultimately limited by 
the growth of the divisor function. Once the iteration is exhausted, we use the resulting 
set of large Fourier coefficients and Plancherel’s Identity to get the upper bound on the 
density of B, which is by construction larger than the density of the original set A, claimed 
in Theorem 1 1.1 1 for 1 < D <2. 

6.3. Reduction to two key lemmas. For the remainder of this section, we fix intersective polynomials 
hi,... ,hi G L[x], partitioned as indicated in Theorem 11.11 Namely, hi ,are arbitrary intersective 
polynomials, ..., h^i+^2 s-''® nonconstant monomials, and ... ,hi are nonmonomials with no 

constant term. We let 

D={'Zk-^+i2 /2+ Y. 

i=l i=^i+^2 + l 

where ki = deg{hi) and is the number of nonzero coefficients of hi, and again we let D' = k//^■ 

Further, we let k = (fci ,... ,k(), k = 0^=1 ki, and p = 2“^°^. We also fix a natural number N and an 
arbitrary e > 0 and let Q = (log We deduce Theorem 11.11 (for 1 < < 2) from two key 

lemmas, corresponding to the two phases outlined in the overview, the first of which yields a set with 
substantial Fourier mass distributed over rationals with many small denominators. 
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Lemma 6.1. Suppose A C [1,iV] with |yl| = SN and {A — A)r\ (j{hi) + • • • + C {0}. If 

(36) 

then there exists B C [1, L] and di,... ,di < satisfying L > N^~p, \B\/L = u > 5, and 

(S - B) n (l{h{^) + • • • + I{hf)^ C {0}. 

Further, B satisfies |i? fl [1,L/2]| > aL/3 and 


(37) 


max 

q<Q 


E 

tGM,(e) 


|B(t)|2 <a2(logiV)-l+^ 


The second lemma corresponds to the iteration scheme in which a set of large Fourier coefficients from 
distinct major arcs is blown up in such a way that the relative growth of the size of the set is much greater 
than the relative loss of pointwise mass. 


Lemma 6.2. Suppose B C [1, L\ and di ... ,di are as in the conclusion of Lemma \6.1l let Bi = Br\ [1, L/2], 
let m = 2(mhi^^i kiY' — 1, and suppose a > . If D <2, then given U,V,K G N with max{{7, V, K} < 

and a set 

Pcjte UM,(iF)U{0}:|B^(t)|>^ 

I 9=1 


satisfying 


(38) \P n M(j ,j(iF)| < 1 whenever q <V, 

there exist U', V, K' gN with max{C/', V, K'} <Ck (max{C/, V, and a set 

(39) p'cltG\jM,{K')U{0}:\B[{t)\>^ 

[ 9=1 

satisfying 

(40) \P' r\'M.a^q{K')\ < 1 whenever q<V' 
and 


(41) 


in 

{wy 


>E(iogiV)i-i5T 


6.4. Proof of Theorem 11.11 for 1 < £> < 2. In order to establish Q, we can assume that 

S> (log7V)“'°s^°s^°8'°s^. 

Therefore, Lemma lOl produces a set B of density a > 6 with the stipulated properties, and we set Pq = {0}, 
Uo = 3, and Vq = Kq = 1. Then, if D < 2, Lemma |ffi2] yields, for each n, a set Pn with parameters C/„, Vn, Kn 
such that 

max{C/„,P„,if„} < (iogAr)W 2 )"+^iogiogiogiogJV 

and 

teP„ " 

where the left-hand inequality comes from Plancherel’s Identity, as long as max{t/„, Vji, This 

holds with n = (1 — e)(loglogloglogiV)/log(m/2), as (m/2)"+^ < (logloglogTV)^”'^/^, and desired case of 
Theorem 11.11 follows. □ 
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6.5. The Outer Iteration. We begin the first phase with the following discrete analog of Lemma H751 the 
proof of which is effectively identical, and versions of which can be found in m and [Ill- 

Lemma 6.3. Suppose B C [1,L] with \B\ — aL. IfQ<9<\ and 

E 

then there exists an arithmetie progression 

P = {x + £q : I < £ < L'} 

with qV ^ min{0, K~^}L and \B (1 P\ > a{l + 6*/32)L'. 

Lemma 16.31 and Proposition 12.11 immediately combine to yield the following iteration lemma, corresponding 
to Case 1 discussed in the overview, from which we deduce Lemma l6.II 

Lemma 6.4. Suppose B C [1, L] with \B\ = aL and {B — B) r\ (^I{h'^^) + • • • + C {0}. If 

(42) E 

for some q < Q, then there exists B' C [1,L'] satisfying L' ^ LjQf^^, 

(B'-B')n + C {0}, 

and 

\B'\/L' > a(l + (log Af)-i+732). 

Proof of Lemma 16.11 Setting Ao=A, Nq = N, Sq = <5, and di,...,d^ = 1, we iteratively apply Lemma 
16.41 This yields, for each j, a set Aj C [l,iyj] with \Aj\ = 6jNj and 

{A, - A,) n (/(<7 +... + I{hf )) c {0}, 

satisfying 

(43) fVj > iV/(CQ)('=+l)^ 7 > 7_i(l + (logiV)-i+732), d, < Q!^\ 

where C is an absolute constant, as long as either 

(44) niax E 

or \Aj n [l^Nj/2]\ < 5jNj/3, as the latter condition implies Aj has density at least 37/2 on the interval 
{Nj/2,Nj]. We see that by (IMl) and (|i5)) . the density 6j will exceed 1 after 

641og(<5-7(logAf)^-'= < (logiV)i-'^/2 

steps, hence fails and \Aj fl [l,Nj/2]\ > SjNj/3 for some 

(45) /< (logiV) 1-^/2. 

However, we see that (1551) . (H51) . and (HSl) imply 

so we set B = Aj, L = Nj, a = Sj, and dt = dj for 1 < i < £, and we see further that 

7 < < g(logW)i-'/'‘ < j^p/2^ 

as required. □ 

The task of this section is now reduced to a proof of Lemma 16.21 
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6.6. The Inner Iteration: Proof of Lemma 16.21 Let B C [1,L] and G N be as in the 

conclusion of Lemma 16.11 let Bi= Bn [l,L/2], let m = 2(min{fci})^ — 1, and suppose a = \B\/L > Q 
For 1 < * < let Mi = [(L/3t'|6i|)^/^*J, where hi is the leading coefficient of hfh Letting 


= 


{nGn-.O <hf{n) <L/U} 
{n Gn : -L/U < hf{n) <Q} 


if > 0 
if < 0 ’ 


we note that by (US we have 


(46) A [1,M,]| 1. 

for 1 < i < £. Suppose we have a set P with parameters U, V, K as specified in the hypotheses of Lemma [6^ 
fix an element s G P,\et r] = cocrjU for a sufficiently small constant cq = co(k, e) > 0, let W = np<,)-(D'+o P 
where the product is taken over primes, and let w = np<r)-(D'+o (1 ~ 1/p)- in previous sections, we let 
h{n) = + ••• + Z = {n G : rn G Hi, h{n) = 0}, Hi = {n G Hi : {n,W) = 1}, and 

H=[HiX---xHi,x Hi,+i ■■■xHi^\Z. 


Since (B - B) n (l(h{^) + • • • + )) C {0}, we see that there are no solutions to 

a — b = h{n) mod L, a G B, b G Bi, n G H. 
Combined with (1461) and the orthogonality of the characters, this implies 


B{t)Bi{s + t)T{t) = h'{n)B{x + = Oh((minwMi) ^), 




X^Zl 

nGH 


where 


h'in) = Y[ihfy{ni), 


2 = 1 


.Mi Mi 

T,(t) = y fyt) = — y] 


n—1 

{n,W)^l 


and 


m = i[m) n m), 

i=i j=ei+i 


which immediately yields 


y] \BmBi{s+t)\\T{t)\> Y. mB^{s+t)T{t) 

tGZi\{0} tGZi\{0} 

= .§(0)|:b;(s)|T(0) - Oh((minu;M,)-i). 
Therefore, since |i3i(s)| > cr/U, T(0) > l/(4£)^, and cr~^,U < Q, we have that 


(47) 


y] \B{tm,{s+t)\\Tit)\>-^. 

tGZi\{0} ^ ’ 


It follows from traditional Weyl sum estimates and Lemmas 11 and 28 of m that 


(48) 


\m\ < 


for alH £ m (?7 ^,r] 


{loeyu 

provided we choose cq sufficiently small, and 

(49) |r(t)| «h,. min{l, (L|t/L - a/q\)-^} 

if t £ {a,q) = 1, and q < where io{q) is the number of distinct prime factors of q and 

C = C'(k). Estimates (1481) and (l4^ follow from the same ingredients as (l23t and (l24l) . and are discussed 
further in Appendix 


22 









We have by (liSl) . Cauchy-Schwarz, and Plancherel’s Identity that 


E 






{myu' 


E 


\B{t)\\B,it)\\T{t)\> 


{loeyw 


which together with (j22]l yields 

(50) 

We now wish to assert that we can ignore those frequencies in the major arcs at which the transform of B or 
Bi is particularly small. In order to make this precise, we first need to invoke a weighted version of known 
estimates on the higher moments of Weyl sums. Specifically, it follows from Theorem 1.1 of [52] that 

(51) ^|T(t)r<C, 

where C = C{m). Choosing a constant 0 < ci < ((40£)'^C'^/"‘)“"*/^, where C comes from em, we define 

(52) X = {te 911 ( 77 - 1 ,:min{|i3(t)|,|iT((s + l)|} < 
and 

Using Holder’s Inequality to exploit the higher moment estimate on T, followed by Plancherel’s Identity, we 
see that 


t£X 


t£X 
2/m •^+1 


^|%)||iii(s + t)||r(t)|< (^|%)|^|i?i(s + t)|^) (^|T(i)| 

( E max{|5(l)|M5i(s + t)p|) ™ -Cl 


< 


< 


< 


Cl a rr 

‘ ( 

U 

1 




{myu 


( E + 1^1 ('S + i)p) 




and hence by 0 we have 
(53) 

For i,j,k G N, we define 


{20£yU' 

^\Bit)\\Bys + t)\\Sit)\>-^ 


U' 




[a/q : (a,q) = 1, 2* ^ <q< 2\ ^ < max|.g(t)| < ^ < max|Bi(s + t)| < , 


21 _ ' '“"'-21-1’ 2 '=“ I ' 71 - 2fe-i 

where the maximums are taken over nonzero frequencies t G 'M.a/qiv~^)- We see that we have 

(54) ^ ^ |%)||:B;(s + ^)||T(^)|«|7^,,,,fc|^ max ^ |T(t)|. 

It follows from (l49l) . the bound U,a~^ < Qi/"*, and the standard estimates 

uj{q) < log g/log log ( 7 , q/(j){q) < log log g, 

that if (a, q) = 1 and q < then 

^ |T(t)| «k,. C'-(^)(g/<^(<7))^=+^^g-i/i'log(Q) 




q-^^^i\ogNy 
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hence by (IMl) . and the fact that D <2, we have 

(55) ^ ^ |S(t)||:B;(s + t)||r(t)|«, |7^,^,■fc|^2-*/2(log7V)^ 


By our definitions, the sets exhaust Y by taking 1 < 2® < and 1 < 2^, 2^ < 

a total search space of size <Ck,e (log Q)^. Therefore, by (1551) and (I55|) there exist i,j, k in the above range 
such that 

In other words, we can set 14 = 2®, W4 = 2-1, and Ug = 2^ and take an appropriate nonzero frequency from 
each of the pairwise disjoint major arcs specified by to form a set 


V. 


PsQ\t€ y M,(77 ^) : \Bi{s + t)\> 

q=Vs/2 


a 

TTs 


which satisfies 

(56) 

and 

(57) 


1/2 


IP. I 


UsWgVg 
C/(loglV)2<=’ 


|Ps n Ma,q{r] ^)| < 1 whenever q<Vs, 


max |P(t)| > T 7 W whenever q < Vg and 'Ma/n{r] (1 Pg ^ 0, 

iGM„/,(^-i)\{0}' Wg /9V/ / 


noting by disjointness that a/q € Pi,j,k whenever q < Vg and n Pg ^ 0. 

We now observe that by the pigeonholing there is a subset P C P with |P| »k,e |P|/(log Q)^, and hence 
(58) |P|», |P|/(logiV)4 

for which the triple Ug^Wg^Vg is the same. We call those common parameters U and V, respectively, 
and we can now foreshadow by asserting that the claimed parameters in the conclusion of Lemma 16.21 will 
be U' = U,V' = VV, and K' = K + , which do satisfy the purported bound. 


We let 


77.= '^- + -:sS for some s G P and t G Mf,/r(^ ^) for some t G Pg 


By taking one frequency s + t associated to each element in 77., we form our set P', which immediately 
satisfies conditions (1551) and (1401) from the conclusion of Lemma 16.21 However, the crucial condition (15T1) on 
|P'|, which by construction is equal to |77|, remains to be shown. To this end, we invoke the work on the 
combinatorics of rational numbers found in [13 and [I]. 


Lemma 6.5 (Lemma CR of [T]). 


|P| > 


|P|(min p \Pg\) 


where 


VEt^ (I+ \ogV) 

E = maxllfo : (6,r) = 1, Mb/r(? 7 “^) n y P^ ^ 0| 

r<V I I- _ J 


seP 


T[q) is the divisor function and t = max^^yy p{q)- 


It is a well-known fact of the divisor function that T(n) < for large n, and since rj VV < Q, we 

have that r < (logiV)*^. 


24 








We also have from dSZl) that 



where the last inequality follows from (1571) . and hence 

(59) E <W^{\ogN)-^+\ 


Combining the estimates on r and E with (I56p , (1551) , and Lemma 16.51 we have 


^ (log NY C/2 (log NY- \ ,, ^ 


\P\ U^W^V 

,cr NV rt2nncr /V'l4£ 1 



Recalling that we set U' = U, the lemma follows. 


□ 


Remark on Theorem \1.2\ for 1 < D' <2. To avoid excessive redundancy, we omit the details of the double 
iteration method with prime inputs, which establishes the bounds in Theorem 11.21 for 1 < D' < 2. All of 
the tools required to adapt the argument from unrestricted inputs to prime inputs are already on display in 
Section [5Js adaptation of Sarkozy’s method. For a detailed treatment of the double iteration method with 


prime inputs in the single polynomial case, the interested reader may refer to Chapter 11 of [18) . 

Appendix A. Exponential Sum Estimates 

In this appendix, we either invoke or prove all exponential sum estimates necessary to establish the crucial 
major and minor arc upper bounds in Sections ID and E) namely (|23|) . (l24)l . (l48ll . and (|49ll . The first two 
lemmas provide asymptotic formulae for the relevant Weyl sums near rationale with small denominator. 

Lemma A.l. Suppose g{x) = oq + aix + • • • + OjX^ S A > 1, and let J = \aY + ■■■ + \aj\. //a, g S N 
and a = a/q + Y, then 



Proof. We begin by noting that for any a, g G N and x > 0, 


(60) 



n=s mod q 


Since 


(61) 


#{1 < n < X : n = s mod g} = x/q + 0(1). 


Using deo]) and successive applications of summation and integration by parts, we have that if a = a/g + /I, 
then 




□ 


as required. 
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Lemma A.2. Suppose g{x) = ao + aix + - • -^-ajX^ G A > 1, VL = Y\p.^YP’ = |ao| + - • • + |aj|- 

// a, <7 G N and a = a/q + P, then 




n—1 

(n,VF) = l 


p<Y ^ s=0 

Pt9 ((s,(j),W^)=l 


Proof. Lemma IA.2I follows by simply mimicking the proof of Lemma lA.li replacing (ICTl) with the fact that 
if {{s,q),W) = 1, then 

#{1 < n < X : n = s mod q, (n, W) = 1} = — -J ttsfw ) j ^ 


p<Y 

p\g 


which follows from Theorem 7.2 in [7] as exhibited in [T], whereas otherwise this set is empty. 


□ 


As indicated by the asymptotic formulae in Lemmas lA.il and I A. 21 we can beat the trivial bound on the Weyl 
sums near rationals with small denominator by invoking nontrivial estimates on the Gauss sums, which are 
provided by the next three lemmas. 

Lemma A.3 (Lemma 6, [13]). If g{x) = oq + aix + • • • + ajx^ G Z[a;], j > 1, then 


9-1 

IE- 

s=0 


,27Tig{s)/q 


gcd{cont{g),qy^^q^ 


where 


cont(gf) = gcd(ai,.. .,%■). 
Lemma A.4 (Lemma 2, [I]). If {a,q) = 1, then 


9-1 

E ' 


^27t[s^ a/q 




where C = C{j). 

Lemma A.5 (Theorem 1, [26)1. If g G Z[a:] satisfies g{0) = 0 and has r>2 nonzero coefficients, then 


9-1 

E 

s=0 
(s.9) = l 


p2Trig{s)/q 


< gcd(cont( 5 ),g)i/’-gi 


— 1/r 


Lemmas IA.3I and IA.5I indicate that we could potentially lose control of the Gauss sums if the coefficients 
of the auxiliary polynomials gain larger and larger common factors during the iterations, but the following 
observation of Lucier asserts that this is not case. 


Lemma A.6 (Lemma 28, [13)1. If g G Z[a:] is an intersective polynomial of degree j and g‘^ are the auxiliary 
polynomials as defined in Section d] then for every d G N, 

cont(5‘^) < |A(5)|('^"^)/^cont(5), 

where A{g) = ~ if g factors over the complex numbers as 

g{x) = a{x — aiY^ .. .(x — Or)®’’ 


with all the Ui’s distinct. 


The following standard result combines with Lemmas lA.ll and IA.2I to allow us to gain additional savings 
from the trivial bound on the Weyl sums close, but not too close, to rationals with very small denominator. 
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Lemma A.7 (Van der Corput’s Lemma). If j > I and X > 0, then 

rX 


2'K\x^ ^ 


da; 


<m 


-i/i 


Finally, we invoke a variation of the most traditional minor arc estimate, Weyl’s Inequality, to get the desired 
estimates far from rationals with remotely small denominator. 

Lemma A.8 (Lemma 3 in [3]). Suppose g{x) = oq + aia; + • • • + ajx^ with G M and aj € N. If (a, g) = 1 
and \a — a/q\ < q~‘^, then 

X 


n—1 


^27rig{n)a 


A 


(aj\og^^{ajqX){l/q+l/X + q/ajX^)'j 


2 -J 


A.l. Proof of (|23p and (|24p . We return to the setting of the proof of Lemma l4.2l recalling all assumptions, 
notation, and fixed parameters. Fixing a S T and letting Z = , the pigeonhole principle guarantees 

the existence oi 1 < q < N/Z and (a, g) = 1 with 

Z 


a 

a - 

g 


< 


qN' 

Letting P = a — a/q, we have by Lemmas lA.II and lA.21 that 


(62) 

and 


9-1 .Mi 

5,(a) = g-i(^ + O?,, (g(l + A/3)) 


9-1 


(63) Si{a) = -^( V I g2^i/i“-(a.)/3^^ OhiiMiC 4D°iog(‘.-l) (1 + /V/3)) 

((s,9),iv)=l 


for 1 < / < f, where 


n ('-;)^ 

p<^-(i3'+e) 


-(D'+e) 

Pl9 

In particular, if g < 77 “then 

9-1 .Mi 


s=0 


and 


jT/f , ^ I , V pMi , 


(f){q)Mi 


s=0 
(s,9) = l 


If g < Q and |/3| < 7 , then (1331) follows from the definition of <S'(a) by trivially bounding the above integrals 
and applying Lemmas IA.3I IA.4I IA.5I and IA. 6 I 

If g < Q and 7 < |/3| < /qN, then after trivially bounding the exponential sums in (1621) and (1331) . ((331) 
follows from Lemma IA.7I and the observation that 


(64) 


fM 

Jo 


where b/' is the leading coefficient of hf*. 

For Q < q < we have shown that (1231) holds, and in this range (I24p follows from 
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If r] < g < Z, the exponential sum in (IMl) does not collapse quite as conveniently, but by appropriately 

separating the sum and applying Lemmas IA.3I and IA.6I we still have 

9-1 


(65) 


E ' 

s=0 

{{s,q),w)=l 


(s)a/g 




l-liki+e/2i 


The deduction of this estimate is a simpler version of the proof of Lemma IB.2[ and (1241) then follows from 
trivially bounding the integrals in (I62p and (|55)) and applying (|55|) and Lemmas IA.3I and IA.6I 


Finally, if Z < g < N/Z, then (IM)) follows with room to spare from Lemma [A.81 as exhibited in Section 4 of 
[T], and the desired estimates are established in all cases. □ 


A.2. Discussion of (I48p and (|49p . The weighted exponential sum estimates (1481) and (|4^ are obtained 
by mimicking Section lA.ll applying Lemmas IA.3IIA.6l as well as a weighted version of Lemma lA.ll which 
follows analogously, and a weighted version of Lemma IA.8[ which follows from summation by parts. To 
highlight the reasoning for applying the derivative weight, we note that 


rMi 




hi'(Mi) 


g2^iy/3dy <min{L, 


Jo Jo 

yielding the last term on the right hand side of (1491) and the tolerable logarithmic accumulation in summing 
over an entire major arc, which is crucial for the rest of the argument. 


Appendix B. Exponential Sum Estimates over Shifted Primes 

In this appendix, we prove or invoke prime input analogs of the estimates in Appendix which are required 
to establish (IMl) and (1551) . We begin with the following analog of Lemma FA.II 

Lemma B.l. Let N, Q, k, K, qo, and x be as fixed in Section\^ Suppose g(x) = oo + aix H-h OjX^ € Z[a;] 

and let J = |ao| H— • + \aj\. If qo \ d, d/qo < Q, a,g G N, g < (goQ)®^, X > and a = a/q + fi, then 

(1 - xir)idxy-^)e‘^^'3^^^<^dx + 0{qX{l + JX^ 

dn-\-rprime 

where 

9-1 

g{a,q)= 

{ds-\-r,q) — l 


X 


log(dn + = 


pX 


fi{qd) 


Gia,q) 


Proof. Lemma lB. 1 I follows by mimicking the proof of Lemma lA.il replacing (IFTI) with (l30l) and the observation 
that 

X 

log(dn + r) = fii^dX + r,ds + r, qd). 

n—1 

n=s mod q 
dn-\-r prime 

□ 


In place of Lemmas IA.3IIA.5l we apply the following restricted Gauss sum estimate. 

Lemma B.2. Suppose g{x) = Oq + aix + • • • + ajx^ € Z[a;]. If d,r € h, g G N and (a, g) = 1, then 

^ <C'-(^)(gcd(cont(g),gi)gcd(a„g2)) ' 

s—0 
{ds-\-r,q) — l 

where C = C{j), q = qiq 2 , and g 2 is the maximal divisor of q which is coprime to d. 


( 66 ) 
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Proof. Fix g,d,r,a,q as in Lemma IB.2I As is often the case with this type of sum, we can simplify our 
argument by taking advantage of multiplicativity. Specifically, it is not difficult to show that if g = qiq 2 with 
(gi,g 2 ) = 1, then 


9-1 

iq _ 

s=0 

(ds-\-r.q) — l 



2Trig(si)ai/qi 



c‘iTrig{s2)a2/q2 


where afq = ai/qi + 02 / 52 , so we can assume q = p" for some p G P, u G N. If p | d and p | r, then ds + r is 
never coprime to p’', so the sum is clearly zero. If p | d and p -j' r, then ds + r is always coprime to p", so the 
sum is complete and the result follows from Lemma |A.3I If p f d, then p | ds + r if and only if s = —rd~^ 
mod p. Therefore, 



p‘^-1 

p'^-1 

p’'-i-i 

(67) 

g27ric/(s)a/p^ 

— g27ri9(s)a/p” _ 


s^O 

p\ds-\-r 

s=0 

s'=0 


g 2 iri 9 (ps'+m)a/p” 


where m = —rd ^ mod p, and by Lemma I A. 3 1 we need only obtain the estimate for the second sum. Setting 

X 9{ps + to) - g{m) 

9{s) =- - -, 

we see that p is a polynomial with integer coefficients and leading coefficient ajp^~^. In particular, 

gcd(cont(p),p’'"^) < p"~^ gcd(aj,p”"^). 

Therefore, by Lemma lA.31 we have 


.-i_i 


p''-i-l 

g2-irig(ps+m)a/p‘" 

= 

g27ri(/(ps+m)-/(m))a/p 

s=0 


s=0 





= 

g27rig(s)o/p”“^ 


s=0 

(p"-i 


as required. 


□ 


Remark. The factor of the conclusion of Lemma fB.2l arises when exploiting multiplicativity after 

decomposing q into a product of prime powers. In the published version of US], this factor is incorrectly 
absent in the corresponding Lemma 9, while the identical proof is provided. Fortunately, this oversight has 
no bearing on the main results of that paper (a corrected version has been uploaded to the arxiv server), 
but the distinction is relevant in certain cases here. 


In the case that £ = ki = 1, that is to say the case of p ± I differences or a fixed multiple thereof, we can 
evaluate the relevant local sum precisely, showing that it has magnitude at most 1. Here we restrict to the 
p — 1 case, and the p + 1 case is analogous. 


Lemma B.3. If {a,q) = 1 and d G Z, then 


E 


g2irisa/9 _ 


s=0 

(ds+l, 9 ) = l 


0 


if {d, q) 

else 


1, where j = d ^ mod q 


where p is the Mobius function. 
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Proof. Again we exploit multiplicativity, and we see from the Chinese Remainder Theorem that if (a, g) = 1 
and q = qiq 2 with ( 91 ,^ 2 ) = 1, then 

q-l qi-l 92-1 

E 

r—0 ri—0 S 2 —O 

{ds-\-l,q) — l (dsi + l,< 7 i) —1 (ds 2 + l,q 2 ) —1 

where ajq = aijqi a 2 1q 2 - Therefore, we can assume q = for p G V and v G N. If p \ then we always 
have {ds + l,p’') = 1, so the exponential sum is complete and equal to 0 by orthogonality. 

If p f d, then we can change variables in the sum setting r = ds + 1, which yields 


g27risiai/gi E g27ris2a2/92 


g27risa/9 


p"-l 


g27ri(r-l)ja/p” 


r =0 


where j = d ^ mod p", so the lemma follows from the identity 


^2mralp'" = ifw— 1 

“ 10 else 

r=0 t 

p\r 


which again follows from orthogonality. □ 


In place of Lemma |A.8[ we invoke the following prime input analog, which is a less precise, only nominally 
generalized version of Theorem 4.1 in m- 

Lemma B.4 (Lemma 4 in [3]). Suppose g{x) = oq + aia: + ••• + ajx^ £ and let J = If 

U > log A, |aj| > c{^ai-i\ + • • • + |ao|^, and |d|, |r|, |aj| < [/■', then 

^ V 

^ log(dn + Tf + 

n—l 

dn-\-r prime 


provided 

\a — a/q\<q~^ for some U'’< q < h{X)/U‘’ and {a,q) = l. 


B.l. Proof of (|34p and (|35p . We return to the setting of the proof of Lemma l5.41 recalling all assumptions, 
notation, and fixed parameters. Here we establish (l34t and (1351) by mimicking the proofs of (l23l) and (1241) . 
The details can be fleshed out by referring to section lA. 11 and also applying (l32l) when necessary. 

Fixing a £ T, the pigeonhole principle guarantees the existence oi 1 < q < L/(qoQ)^^ and (a, q) = 1 with 


a 

a - 

9 


< 


(90 Q) 


2K 


qL 


Letting fi = a — ajq, we have that if q < {qoQf^, then (l34l) follows from Lemmas IB.HIB.31 In particular, 
(IM)) holds if 9 < Q and \j3\ < 7 . 

If 9 < <5 and 7 < 1/31 < -, then (1351) follows from Lemma IBTI Lemma IAT tI and integration by parts. 

qL 

li Q < q < {qoQ)^^, then, as previously mentioned, (1341) holds, and in this range (1341) implies (l35l) . 

Finally, if {qoQ)'^^ < 9 < L/{qoQ)‘^^, then (l35l) follows from applying Lemma iBAl with U = (qoQ)^- □ 
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